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QO ' First evidence for single top quark production has recently been found by both the 

| DO and CDF experiments. By combining three analyses, DO measured a cross section of 

a (pp — > tb + X, tqb + X) = 4.7 ±1.3 pb, with a significance of 3.6 standard deviations, and 
• CDF's matrix elements analysis measured a cross section of a (pp — ► tb + X, tqb + X) = 

3.0l^'^pb, with a significance of 3.1 standard deviations. These analyses also provided the 
\ first direct measurements of the CKM matrix element, | V* & | , without assuming unitarity. 

i—,, This talk briefly describes the latest single top production results from the two experiments. 
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10 ; I. INTRODUCTION 
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The top quark was discovered in 1995 by the DO and CDF collaborations, produced in pairs 

<n ; nn 

by the strong force l|, However, the top quark can also be produced singly, by the electroweak 

00 : 

force, providing a powerful probe of charged-current interactions involving the top quark. The 

£> ' 

. _ ' two categories of single top processes that are significant at the Fermilab Tevatron Collider are 

X • 

s-channel, which involve the exchange of a time-like W boson, and i-channel, which involve the 
exchange of a space-like W boson. Representative leading order (LO) Feynman diagrams are 
given in Fig. [TJ For proton-antiproton collisions with y/s = 1.96 TeV, using a top quark mass of 
175 GeV, the theoretical cross section at next to leading order (NLO) for s-channel production is 
a s = 0.88 ± 0.11 pb, and for t-channel production, at = 1.98 ± 0.25 pb [3]. The given cross sections 
are for the sum of top and antitop production. In addition to s-channel and t-channel production, 
a single top quark can also be created in association with an an on-shell W boson, but this process 
is negligible at the Fermilab Tevatron Collider. 

One of the main reasons to measure the single top production cross section is because it is 



'Electronic address: jmitrevsOscipp. ucsc.edu 



2 



u t u d 




FIG. 1: LO Feynman diagrams for single top production via the (a) s-channel and (b) i-channel process. 



proportional to |V#,| ; thus measuring it is a way to directly measure the CKM matrix element 
\Vtb\ without assuming three generations or unitarity. Additionally, single top quark production is 
sensitive to physics beyond the standard model [4]. Lastly, single top production is a background 
to Higgs and other beyond the standard model searches. 

This talk summarizes the latest single top quark searches performed by the DO and CDF ex- 
periments. The DO results are based the analysis published in Ref. [5J], using 0.9 fb" 1 of data, with 
some updates to the techniques. The CDF results are based on 1.5 fb -1 of data. 



II. EVENT SELECTION 



According to the standard model, the top quark decays almost exclusively into a 6-quark and 
a W boson. Figure Q] shows that the s-channel single top quark process has a second 6-quark 
in the final state while the t-channel single top quark process has a light spectator quark. For 
i-channel processes, the initial state 6-quark comes from gluon splitting, so sometimes another b- 
quark, generally with low transverse momentum, is found in the final state. That is why t-channel 
production is sometimes labeled tqb. Additional jets can come from radiated gluons. For events in 
which the W boson decays into either an electron or a muon plus a neutrino, the main backgrounds 
are VF+jets, including VF+heavy flavor, tt, and QCD multijet events where one jet is misidentified 
as a lepton or a lepton strays from a jet. Diboson and Z+jets events form a smaller background. 

The selected events were required to have one reconstructed charged electron or muon, from 
the neutrino, and at least two jets. CDF required exactly two jets, while DO used events with two, 
three, and in some analyses four jets. At least one jet had to be identified as coming from a 6-quark 
by lifetime tagging. Additional cuts were applied to improve the purity of the sample by reducing 
the fake lepton background, either by cutting on the difference in azimuthal angle between the -^t 
and the jets or leptons, or on the transverse mass of the reconstructed W boson, or on the ft? 
significance. The events were triggered by a lepton+jets trigger at DO or a single lepton trigger at 
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TABLE I: Predicted event yield at CDF with 1.5 fb -1 and at DO with 0.9 fb -1 for electrons and muons 
combined and at least one 6-tagged jet. In the DO analyses the diboson and Z+jets backgrounds are not 
separately modeled but are instead included in the VF+jets normalization. 



Source 


CDF Event Yields 
2 jets 


2 jets 


T"\/~\ T~\ 1 "XT'* 1 1 

DO Event Yields 
3 jets 


4 jets 


s-channel 


23.9 ± 6.1 


16 ± 3 


8 ± 2 


2 ± 1 


i-cnannel 


OTA 1 C A 

37 .0 ± 5.4 


on i a 

20 ± 4 


1 O 1 o 

12 ± 3 


A 1 1 

4 ± 1 


tt — > U 




39 ± 9 


32 ± 7 


11 ± 3 




85.3 ± 17.8 








tt — > ^+jets 




20 ± 5 


103 ±25 


143 ± 33 


Diboson 


40.7 ±4.0 








Z+jets 


13.8 ±2.0 








W+bottom 


319.6 ± 112.3 


261 ±55 


120 ±24 


35 ±7 


W ±charm a 


324.2 ± 115.8 


151 ±31 


85 ± 17 


23 ±5 


IT+light jets 6 


214.6 ±27.3 


119 ±25 


43 ±9 


12 ±2 


Multijets 


44.5 ± 17.8 


95 ± 19 


77 ± 15 


29 ±6 


Total background 


1042.8 ±218.2 


686 ±41 


460 ± 39 


253 ± 38 


Observed 


1078 


697 


455 


246 



"only W + cc in DO 
'contains W + c in DO 



CDF. The CDF triggers had a requirement for high pseudorapidity electrons. Table [I] shows 
the resulting yields for the CDF and DO analyses. 



III. MULTIVARIATE ANALYSES 



In order to extract the signal from the much bigger background, multivariate analyses were 
used. Such techniques generally build a discriminant of the type: 

p(x|5)+p(x|5) [L) 
where p(x\S) is the probability density to observe the given configuration (x) of jets and leptons 
given that the event is signal, and p(x\B) is the corresponding probability density given that the 
event is background. 

Both DO and CDF performed a single top search using the matrix elements technique, which 
calculates p{x.\S) and p(x\B) using 

p(x|processJ = — (2) 
Oi ax 
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D0 Run II Preliminary 



L = 0.9 fb" 1 



CDF Run II Preliminary, L=1 .51 fb 
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FIG. 2: Discriminant output of the ME analyses for DO (a) and CDF (b). 
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where <7j is the cross section of a given process, with numerical (usually Monte Carlo) integration. 
Both used MadGraph LO matrix elements for the calculation 6[|. One complication is that the 
event configuration, x, refers to reconstructed objects, such as jets, while the matrix element uses 
the parton-level event configuration, represented by y. The parton-level values are summed over: 

d(Ths,j 



da 

d: 



/i,i(y,Q')/2j(y,Q 2 



dy 



(y)VF(x|y,j)ep art0 n(y) 



(3) 



where dats/dy is the differential cross section for the hard scatter collision, / represent the parton 
distribution functions for the proton and antiproton, W the transfer function relating the proba- 
bility to observe event configuration x given event configuration y, and O represents parton-level 
cuts. The summation is over the various ways to assign jets to partons. 

DO's current matrix elements results are an update to those published in Ref. [5] , with the pri- 
mary change being the addition of a ti matrix element to the three-jet discriminants. Nevertheless, 
the main sensitivity remains in the two-jet events. Figure. E)^a) shows the output for the t-channel 
discriminant for two-jet events used by DO, normalized to the measured cross section, and Fig[2]^b) 
show the output of the combined discriminant from CDF's analysis, normalized similarly. 

DOperformed an analysis using boosted decision trees, which was the main analysis described in 
Ref. [5||. It has not been updated, but it nevertheless remains the one with the greatest measured 
significance. Decision trees work by choosing a sequence of cuts, but unlike regular cuts-based 
analyses, they look on both sides of a cut, adding additional cuts until an end criteria is satisfied. 
The sequence of cuts forms a binary tree, and each leaf has a purity (p) associated with it, describing 
what fraction of the leaf content is signal. For a given event, -D(x) from Eq. [I] is approximated 
as -D(x) = p x , where p x is the purity of the leaf where the event is assigned. Boosting improves 
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the discriminating power by averaging over a number of trees, each tree being modified in such a 
way as to improve the classification of misclassified events, which are signal events that fall in a 
background-like leaves or background events that fall in a signal-like leaves. Such events are given 
a boost in their weight, thereby making the tree-growing algorithm give them preference. 

DO used 49 variables as inputs to the decision trees, which can be classified in three categories: 
individual object kinematics, global event kinematics, and angular correlations. Some variables 
with high discriminating power are the invariant mass of all the jets in the event, the invariant 
reconstructed top quark mass, the angle between the highest-px ^-tagged jet and the lepton in the 
rest frame of the reconstructed top quark, and the lepton charge times the pseudorapidity of the 
untagged jet (Q X 77). 

CDF performed an analysis using a likelihood discriminant. For the t-channel discriminant, it 
used the scalar sum of the jets, charged lepton, and ft? (Ht)] the cosine of the angle between 
the lepton and the spectator jet; Q x rj; the dijet invariant mass; the log of the t-channel matrix 
element; the output of a neural network jet flavor separator; and the x 2 °f the kinematic solver, 
which reconstructs the top quark and the W boson. The s-channel discriminant was built similarly. 

DO performed an analysis using a Bayesian neural network. This analysis has been updated 
from what was published in Ref. [sj], with much improved significance. Unlike a regular neural 
network which uses one set of network parameters (wo) determined by the training process to 
approximate D(x) = n(x, wq), a Bayesian neural network integrates over all possible weights: 



where p(vs\T) represents the probability density to have training weights w given training data T. 
Practically, the algorithm is implemented by averaging over a number of individual neural networks, 
so the Bayesian aspect can be viewed as an analogue to the boosting used for the decision trees. 

The Bayesian neural network analysis used between 18 and 25 variables in each analysis channel. 
As an example, the variables with highest discriminating power in the electron, two jet, one 6-tagged 
jet channel are the invariant mass of the leading two jets, the invariant reconstructed top quark 
mass, Q x r/, and the pt of the leading two jets summed vectorially. 




(4) 



IV. RESULTS 



The measured cross sections for the three analyses performed by DO, along with their combina- 
tion, and the two analyses performed by CDF, are shown in Table HU The combination was done 
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TABLE II: The measured cross sections. The s-channel + i-channel cross section measurements assume a 
standard model cross section ratio of <J s /&t — 0.44. 



Analysis 


s-channel + i-channel 


Cross Section (pb) 
s-channel 


i-channel 


Boosted decision trees 


4.9 ± 1.4 


1.0 ±0.9 


4 2+ 1 - 8 


Matrix elements (DO) 








Bayesian NN 


4.4±i;« 






DO Combined 


4.7 ± 1.3 






Matrix elements (CDF) 


3.oil:? 


1 1+ 10 

I - i -0.8 


i q+i-o 


Likelihood 


2.711;? 


1 1+ 1 ' 4 
J-- 1 - 1.1 


J-^-i.o 



TABLE III: The expected and measured significances for the various analyses. 



Analysis 


p- value 


Expected 

Significance (std. dev.) 


p- value 


Measured 

Significance (std. dev.) 


Boosted decision trees 


0.0177 


2.1 


0.00037 


3.4 


Matrix elements (DO) 


0.0307 


1.9 


0.00082 


3.2 


Bayesian NN 


0.0155 


2.2 


0.00083 


3.1 


DO Combined 


0.0105 


2.3 


0.00014 


3.6 


Matrix elements (CDF) 


0.0013 


3.0 


0.0009 


3.1 


Likelihood 


0.0020 


2.9 


0.0031 


2.7 



using the Best Linear Unbiased Estimate (BLUE) method [7|. The boosted decision trees analysis, 
and both analyses at CDF also measured separate s-channel and i-channel cross sections, which 
are also given in the table. The expected and measured significances of the analyses are given in 
Table DTD 

Both DO and CDF measured \ Vtb\ within the single top analyses. These measurements make no 
assumptions on the unitarity of CKM or the number of families, but they do require a few assump- 
tions. The first assumption is that the observed single top quarks were produced an interaction 
with a W boson. The second assumption is that \Vtb\ 2 S> \Vtd\ 2 + |^4s| 2 > which is experimentally 
supported by the B(t — > Wb)/B(t — > Wq) measurements done on tt events {si, [J. A third assump- 
tion is that the Wtb vertex is CP-conserving and of the V— A form, though possibly of anomalous 
strength, which is parametrized by a multiplicative constant, , equal to one in the standard 
model. Additional theoretical uncertainties were added to the measurements of \ Vtb\ Q]- 

On DO the \Vtb\ measurement was done in the scope of the decision trees analysis and was 
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published in Ref. [5]. The final results are: 

• l^ib/fi = l-31^02i, if l^tb/i'l is n °t constrained to be between zero and one, 

• \Vtb\ > 0.68 at 95% C.L., assuming that \Vtb\ is between zero and one and = 1. 

On CDF, both the matrix elements analysis and the likelihood analysis performed \Vtb\ mea- 
surements. The results without being constrained to being between zero and one are: 

• f^tb/f = 1-02 ± 0.18(expt) ± 0.07(theory) for the matrix elements analysis, 

• \V tb f[\ = 0.97±£^(expt) ± 0.07(theory) for the likelihood analysis. 
Assuming that \Vtb\ is between zero and one and f[ = 1, 

• l^tbl > 0.55 at 95% C.L for the matrix elements analysis, 

• \Vtb\ > 0.52 at 95% C.L for the likelihood analysis. 

V. SUMMARY AND CONCLUSION 

DO and CDF have both found evidence for single top production. Combining the three analyses 
at DO resulted in a cross section measurement of a (pp — ► tb + X, tqb + X) = 4.7 ± 1.3 pb, with a 
significance of 3.6 standard deviations. With the matrix elements analysis, CDF measured a cross 
section of a {pp -^tb + X, tqb + X) = 3.0^'^pb, with a significance of 3.1 standard deviations. 
The first direct measurements of \ Vt b \ were performed without assuming three generational unitar- 
ity, limiting \Vtb\ > 0.68 at 95% C.L using the DO decision tree analysis, and \Vtb\ > 0.55 at 95% 
C.L using the CDF matrix elements analysis. 
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